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(57) Abstract 

Genetic material encoding luciferase from the marine coclenterate Renilla has been isolated and characterized. This genet- 
ic material allows the production of peptides for use as labels in bioluminescence assays or can itself be directly used to identify 
luciferase genes from related organisms.' " ~ 
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BACKGROUND OF THE INVENTION 

Field of the Invention 
10 This invention relates to the field of genetic 

engineering and is particularly related to the 
expression of proteins by techniques involving genetic 
engineering, 

15 Description of the Background 

The Renilla, also known as sea pansies r belong 
to a class of coelenterates known as the anthozoans. 
In addition to Renilla , other representative 
bioluminescent genera of the class Anthozoa include 
Cavarnularia, Ptilosarcus , Stvlatula , Acanthoptilum . 
and Parazoanthus . All of these organisms are 
bioluminescent and emit light as a result of the action 
of an enzyme (luciferase) on a substrate (luciferin) 
under appropriate biological conditions. Prior studies 
25 have demonstrated that all of the above-mentioned 
anthozoans contain similar lucif erases and 
luciferins. See, for example, Cormier et al., J. Cell, 
Physiol. (1973) 81: 291-298. The lucif erases and 
luciferins from each of these anthozoans will cross- 
react with one another to produce the characteristic 
blue luminescence observed in Renilla extracts. Each 
of these luciferases has similar biochemical 
properties, and the biochemical requirements for 
bioluminescence are identical regardless of the 
35 anthozoan from which the luciferase was derived. 

There has been considerable interest of late 
in replacing radioactive labels used in analytical 
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assays with other types, such as luminescent labels. 
Firefly luciferase, which is a molecule of 
significantly different structure that does not react 
with Renilla -like luciferins, is one molecule that has 
5 been proposed for use as such labels. However, firefly 
luciferase suffers from a number of deficiencies that 
make this molecule less than optimal in biological 
assays. For example, ATP acts as a trigger of the 
firefly luciferase system, and the ubiquitous nature of 
10 ATP makes control of this variable difficult. 

A prior patent application by one of the 
present inventors, U.S. Patent Application Serial No. 
059,137, filed June 5, 1987, describes use of 
coelenterate-derived lucif erases and photoproteins as 
15 bioluminescent labels. Other applications by the same 
inventor, for example, U.S. Application Serial Nos. 
173,045, filed March 17, 1988, and 165,422, filed 
February 29, 1988, describe recombinant DNA capable of 
expressing the photoprotein apoaeguorin. 

The photoprotein aeguorin (which consists of 
apoaeguorin bound to a coelenterate luciferin molecule) 
and Renilla luciferase both utilize the same 
coelenterate lucif erin, and the chemistry of light 
emission in both cases has been shown to be the same. 
However, aequorin luminescence is triggered by calcium, 
does not require dissolved oxygen, and represents a 
single turnover event. In contrast, Renilla luciferase 
is not triggered by calcium and requires dissolved 
oxygen in order to produce light in the presence of 
coelenterate luciferin. Renilla luciferase also acts 
as a true enzyme, catalyzing a long-lasting 
luminescence in the presence of saturating levels of 
luciferin. 
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Sub-attomole levels of aequorin can be 
detected with photometers even though its luminescence 
represents a single turnover event. Renilla 
luciferase, because of its enzymatic ability, should be 
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detectable at levels 1 to 2 orders of magnitude lower 
than aequorin. Furthermore, Renilla luciferase is 
known to be relatively stable to heat, an important 
consideration for assays that often involve incubation 
5 at physiological temperatures. Accordingly, Renilla 

luciferase is a potentially useful label for biological 
and other assays. 

On the other hand, Renilla live on the ocean 
bottom, about 30 to 100 feet deep, and must be 
10 collected by dregging. From 1 kg of Renilla (about 
1000 animals) , approximately 1 mg of pure Renilla 
luciferase can be obtained following a tedious 
procedure which requires purifying the protein about 
12,000 fold. The purification procedure is described 
in Matthews et al. , Biochemistry (1977) 16: 85-91. As 
a result, there has been no development of Renilla 
luciferase as a detectable label. 

Accordingly, improved techniques for the 
production of pure Renilla luciferase are necessary 
before this molecule can be used commercially in 
bioluminescence assays. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
The invention will be better understood by 
reference to the following detailed description and 
examples and the attached Figures that form part of the 
present specification, wherein: 

Figure 1 is the nucleotide sequence of a clone 
that contains a Renilla reniformis luciferase cDNA 
30 sequence. 

Figure 2 is the amino acid sequence derived 
from the open reading frame of the Renilla luciferase 
cDNA shown in Figure 1. 

Figure 3 is the recombinant luciferase amino 
acid sequence with different types of underlining to 
show the location of peptides obtained by digestion of 
native luciferase with V-8 protease. 
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Figure 4 is a table showing the amino acid 
sequence of Renilla reniformis peptides obtained by V-8 
protease digestion. Regions of low degeneracy that 
were selected for preparation of oligonucleotide probes 
are shown by boxes. The probes are shown at the bottom 
part of the Figure. 

Figure 5 is a schematic representation of a 
restriction enzyme map for Renilla luciferase cDNA. 
The lower portion of Figure 5 is a schematic 
representation of sequencing strategy for Renilla 
luciferase cDNA. 

Figure 6 is a map of a Renilla luciferase 
expression plasmid. 

Figure 7 is a schematic diagram of the control 
15 region of the plasmid pTZRLuc-1. 

Figure 8 is a schematic diagram of the 
purification scheme used to purify recombinant 
luciferase. 
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20 SUMMARY OF THE INVENTION 

The present invention provides genetic mater- 
ial encoding Renilla luciferase. The genetic material 
can be used to produce the enzyme for use as 
luminescent tags in bioluminescence assays and for 
25 other purposes for which such labels are desirable. 
Additionally, the genetic material can be used as a 
source of probes that can be used in nucleic acid 
hybridization assays for the identification of other 
luciferase genes from related organisms. Fragments of 
the enzyme can be used to prepare antibodies for the 
purpose of identifying luciferase genes from related 
organisms. Specific genetic materials and luciferase 
proteins are disclosed in the following detailed 
description and examples. 
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DESCRIPTIO N OF SPECIFIC EMBODIMENTS 
The present inventors have identified and 
obtained for the first time genetic material encoding 

luciferase from the coelenterate genus Reniiia which 
previously has been available only in limited 
quantities, since luciferases have a number of uses as 
a bioluminescent label and since Renilla luciferase has 
a number of properties that make it particularly useful 
as a label, availability of the enzyme in significant 
quantities in pure form provides a significant 
commercial advantage over prior sources. The Renilla 
genetic material also provides a source of nucleic acid 
probes for use in hybridization techniques that allow 
location of luciferase genes in related organisms. The 
CDNA sequence for a clone that contains a Renilla 
reniformis luciferase gene is set forth in Figure 1, 
with the translated cDNA amino acid sequence being set 
forth in Figure 2. The coding sequence of the clone in 
Figure 1 begins at nucleotide 10 and continues to a 
stop codon at nucleotide 944. Figure 3 shows a 
complete recombinant Renilla luciferase amino acid 
sequence as produced by an expression system. 

The present invention has specifically 
contemplated each and every possible variation of 
polynucleotide that could be made by selecting 
combinations based on the possible codon choices listed 
in Figure l (with the reading frame beginning at 
position 1 of Figure 1) and in Table 1 (below), and all 
such variations are to be considered as being 
specifically disclosed and equivalent to the sequence 
of Figure 1. Codons are preferably selected to fit the 
host cell in which the enzyme is being produced. 
Selection of codons to maximize expression of proteins 
in a heterologous host is a known technique. 

Other DNA molecules that code for such pep- 
tides can readily be determined from the list of codons 
in Table 1 and are likewise contemplated as being equi- 
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valent to the DNA sequence of Figure 1. m fact, since 
there is a fixed relationship between DNA codons and 
amino acids in a peptide, any discussion in this appli- 
cation of a replacement or other change in a peptide i* 
equally applicable to the corresponding DNA sequence or 
to the DNA molecule, recombinant vector, or transformed 
microorganism in which the sequence is located (and 
vice versa) . 
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Alanine (Ala, A) 
Arginine (Arg, R) 
Asparagine (Asn, N) 
Aspartic acid (Asp, D) 
Cysteine (Cys, C) 
Glutamine (Gin, Q) 
Glutamic acid (Glu, E) 
Glycine (Gly, G ) 
Histidine (His, H) 
Isoleucine (lie, I) 
Leucine (Leu, L) 
Lysine (Lys, K) 
Methionine (Met, M) 
Phenylalanine (Phe, P) 
Proline (Pro, P) 
Serine (Ser, S) 
Threonine (Thr, T) 
Tryptophan (Trp, W) 
Tyrosine (Tyr, Y) 
Valine (Val, V) 
Termination signal 



GCA, GCC, GCG, GCT 

AG A, AGG, CGA, CGC, CGG, CGT 

AAC, AAT 

GAC, GAT 

TGC, TGT 

CAA, CAG 

GAA, GAG 

GGA, GGC, GGG, GGT 
CAC, CAT 
ATA, ATC, ATT 

CTA, CTC, CTG, CTT, TTA, TTG 

AAA, AAG 

ATG 

TTC, TTT 

CCA, CCC, CCG, CCT 

AGC, AGT, TCA, TCC, TCG, TCT 

ACA, ACC, ACG, ACT 

TGG 

TAC, TAT 

GTA, GTC, GTG, GTT 
TAA, TAG, TGA 



nf Y L^ ch . 3 " le "er triplet represents a trinucleotide 
rfoh? h ^K ln ? a 5 end on the left and a 3' end on tht 

£™*f Th - le ^ GrS Stand for the P" in * or pJriSdine 
bases forining the nucleotide sequence: A = lden*ne 

L^ 9 - an ^ e ' ° = c y tosi *e, and T = thymine. The rSA 
code is the same except that U (uracil) replace! T 
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In addition to the specific nucleotides listed 
« Figure 1, DNA (or corresponding RNA) molecules of 
the invention can have additional nucleotides precedino 
or following those that are specifically listed. Por' 
example, poly A can be added to the 3 '-terminal- a 

T/l T" 9 -" f6Wer tha " 20 nUcleotides > -quence can be 
added to exther terminal to provide a terminal sequence 
corresponding to a restriction endonuclease site, stop 
codons can follow the peptide sequence to terminate 
translation, and the like. Additionally, DNA molecules 
containing a promoter region or other control region 
upstream from the gene can be produced. All DNA mole- 
cules containing the sequences of the invention win be 

is b7f 1SaSt PUrp ° Se Since 311 «» 

15 be fragmented to produce oligonucleotide p ro bes and be 
used m the isolation or detection of DNA from biologi- 

cal sources. y 

A number of words used in this specification 
20 ; aV6 . SPeCifiC meanin ^ *" Edition to their more common 
20 meanings. "Renilla luciferase" means the lucif erase 
enzyme isolated ftom a member of the genus Renilla or 
an equivalent molecule obtained from any otn7T^u~ rce 
or synthetically. By "equivalent" is meant, when 
referring to two nucleotide sequences, that the two 
nucleotide sequences in question encode the same 
sequence of amino acids. When "equivalent" is used in 
referring to two peptides, it means that the two 
peptides will have substantially the same amino acid 
sequence. When "equivalent" refers to a property, the 
30 property does not need to be present to the same extent 
(e.g., two peptides can exhibit different rates of the 
same type of enzymatic activity) , but the properties 
are preferably substantially the same. 
"Complementary," when referring to two nucleotide 
sequences, means that the two sequences are capable of 
hybridizing, preferably with less than 25%, more 
preferably with less than 15%, even more preferably 
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with less than 5%, most preferably with no mismatches 
between opposed nucleotides. Preferred hybridizing 
conditions (which are not limited to specific numbers 
of mismatches, are set forth in the Examples. The term 
"substantially" varies with the context as understood 
by those skilled in the relevant art and generally 
means at least 70%, preferably means at least 80%, more 
preferably at least 90%, and most preferably at least 
95%. The phrase "substantially identical" includes 

complete identity as well as less than complete 
identity (e.g., of amino acid sequences or enzymatic 
activity) as established by the prior definition of 
"substantially." The term "isolated" as used herein 
refers to, e.g., a peptide, DNA, or RNA separated from 
other peptides, DNAs, or RNAs , respectively, and being 
found in the presence of (if anything, only a solvent, 
buffer, ion or other component normally present in a 
biochemical solution of the same. "Isolated" does not 
encompass either natural materials in their native 
state or natural materials that have been separated 
into components (e.g., in an acrylamide gel, but not 
obtained either as pure substances or as solutions. 
The phrase "replaced by" or "replacement" as used 
herein does not necessarily refer to any action that 
must take place but to the peptide that exists when an 
indicated "replacement" amino acid is present in the 
same position as the amino acid indicated to be present 
in a different formula (e.g., when leucine instead of 
valine is present at amino acid 11). 

Since the DNA sequence of the Renilla lucif- 
erase gene has been identified, it is possible to 
produce a DNA gene entirely by synthetic chemistry, 
after which the gene can be inserted into any of the 
many available DNA vectors using known techniques of 
recombinant DNA technology. Thus, the present inven- 
tion -can be carried out using reagents, plasmids, and 
microorganisms which are freely available and in the 
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public domain at the time of filing of this patent 
application without requiring a deposit of genetic 
material . 

For example, nucleotide sequences greater thar 
100 bases long can be readily synthesized on an Applied 
Biosystems Model 380A DNA Synthesizer as evidenced by 
commercial advertising of the same (e.g.. Genetic " 
Engineering News, November /December 1984, p. 3) Such 
oligonucleotides can readily be spliced using, among 
10 others, the technique of preparing overlapping comple- 
mentary sequences (e.g, 1-100 of coding strand, 0-50 
and 51-150 of complementary strand, 101-200 of coding 
strand, etc.), followed by hybridizing and ligating the 
strands. Such techniques are well known and are 
15 described in detail in, for example, Davis et al. , 

Basic Methods in Molecular n^i^ ,, Elsevier Scie ' nce 
Publ. Co., inc.. New York (1986). The peptides can 
then be expressed in a host organism as described 
herein. 
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Furthermore, automated equipment is also 
available that makes direct synthesis of many of the 
peptides disclosed herein readily available, especially 
peptide fragments of less than the entire Renilla 
luciferase enzyme. In the same issue of Genetic 
Engineering News mentioned above, a commercially 
available automated peptide synthesizer having a 
coupling efficiency exceeding 99% is advertised (page 
34). such equipment provides ready access to the 
peptides of the invention, either by direct synthesis 
or by synthesis of a series of fragments that can be 
coupled using other known techniques. 

In addition to the specific polypeptide se- 
quence shown in Figures 2 and 3, peptide fragments 
based on these sequences and fragments and full length 
sequences representing minor variations thereof will 
have at least some of the biological activities of 
luciferase and will therefore be useful in appropriate 
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circumstances. For example, fragments of the 
luciferase enzyme sequence can readily be prepared and 
can -be screened for use as luciferin binding site 
models. Peptide synthesizers can be used to prepare 
5 small polypeptide fragments (e.g., less than 100 amino 
acids) or techniques of genetic engineering can be used 
to prepare larger fragments. A simple screening - 
procedure that will identify suitable polypeptide 
fragments consists of attaching a suitable substrate, 
e.g., a coelenterate luciferin molecule, to an affinity 
column and capturing peptide fragments that are 
retained by the bound substrate. Such peptides can 
also be used (and are indeed more likely to be used) as 
immunogens for the preparation of antibodies that can 
be used to screen for the expression of a luciferase by 
a genetically engineered organism, in which case the 
bound substrate will be an antibody or similar molecule 
that binds specifically to Renilla luciferase. 

The ability to prepare and select peptide 
fragments having appropriate binding affinity from a 
larger protein is well known in the art and is 
described in a number of publications, including 
patents. See, for example, U.S. Patent No. 4,629,783, 
which describes the preparation of immunologically 
active fragments of viral proteins that bind with the 
same antibodies as the entire viral protein. 

In addition, minor variations of the previous- 
ly mentioned peptides and DMA molecules are also con- 
templated as being equivalent to those peptides and DNA 
molecules that are set forth in more detail, as will be 
appreciated by those skilled in the art. For example, 
it is reasonable to expect that an isolated replacement 
of a leucine with an isoleucine or valine, an aspartate 
with a glutamate, a threonine with a serine, or a simi- 
lar replacement of an amino acid with a structurally 
related amino acid (i.e., a conservative replacement) 
will not have a major effect on the biological activity 
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of the resulting molecule, especially if the replace- 
ment does not involve an amino acid at a binding site 
or other site of biologic activity. Furthermore, 
additional amino acids can be present at either of the 
5 two termini, or amino acids can be absent from one or 
both of the termini, as is known in the art. 

Whether a change results in a functioning- pep- 
tide can readily be determined by direct analysis for 
function in a assay that relies on ability of the 
10 modified enzyme (or fragment) to carry out the normal 
function of the natural lucif erase enzyme (or 
fragment). For example, modified peptides can be 
tested for ability to catalyze the emission of light 
from coelenterate luciferin by the same techniques 
described below for the recombinant Renilla luciferase 
molecule. Peptides in which more than one replacement 
has taken place can readily be tested in the same 
manner. Preferred peptides differ at no more than 12 
more preferably no more than 5, amino acids in any 
contiguous group of 20 amino acids. Substitutions of 
amino acids, when they occur, are preferably from 
within standard conservative groups. Standard 
conservative groups of amino acids are shown in 
parenthesis using the one-letter amino acid code: non- 
polar (A,V,L,I r P, M ); aromatic (F,T,W); uncharged polar 
(G,S,T,C,N,Q); acidic (D,E); basic <K,R,H) . The aroma- 
tic amino acids are sometimes considered to belong to 
the broader-defined nonpolar (F,W) or uncharged polar 
(T) groups. 
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Salts of any of the peptides described herein 
will naturally occur when such peptides are present in 
(or isolated from) aqueous solutions of various pHs. 
All salts of peptides having the indicated biological 
activity are considered to be within the scope of the 
present invention. Examples include alkali, alkaline 
earth, and other metal salts of carboxylic acid resi- 
dues, acid addition salts (e.g., HC1) of amino resi- 
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dues, and zwitterions formed by reactions between 
carboxylic acid and amino residues within the same 
molecule . 

Although genes and corresponding proteins can 
be prepared by the totally synthetic techniques dis- 
cussed above, in preferred embodiments of the invention 
genetic information is obtained from natural sources 
and identified as described herein. The genetic mater- 
ial is first obtained in the form of a gene library, 
using any of numerous existing techniques. The first 
of these is to randomly shear genomic DNA and insert 
this sheared material into expression vectors, if 
enough recombinants are generated, there is a good 
probability of having at least one recombinant in the 
population which is expressing a fusion protein corre- 
sponding to the enzyme of interest. 

Another strategy for preparing gene libraries 
is to make complementary DNA (cDNA) copies of the total 
mRNA population of the organism and to clone these as 
recombinant molecules in expression vectors. The 
expected nature of the organism (i.e., it was expected 
to have the characteristics of a eucaryote) indicated 
that introns might be present within the coding region 
of the desired gene. Although introns do not preclude 
use of sheared genomic DNA, they increase the number of 
recombinants which must be screened and make further 
analyses substantially complicated. Based on this 
result, use of a cDNA library to obtain Renilla genes 
is preferred. 

Such a library was generated in the laboratory 
of the inventors and screened for expression of a gene 
product having luciferase activity. Details of this 
example are set forth below, including details of the 
experiments that lead to obtaining the complete 
35 sequence of the gene. However, there is no reason to 
believe that the sequence and specific engineered 
organism prepared by the inventors is any better than 
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other clones that can be prepared using the guidance 
set forth in this specification. In fact, it is likely 
that expression of Renilia luciferase can be enhanced 
over that described herein by selection of other 
5 expression systems, as discussed in the examples below. 

Now that the sequence of Renilia luciferase 
has been determined, it is no longer necessary to go 
through these steps to obtain the genetic material of 
the present invention. The polymerase chain reaction 
10 (PCR) technique can now be used to isolate genes from 
•natural sources in a simpler and more direct manner, 
^he PCR technique, including use in diagnosis, is 
disclosed in U.S. Patent 4,683,202, which is herein 
incorporated by reference, since Renilia specimens are 
readily available from the oceans of the world, and 
since PCR probes can be prepared using the sequences 
set forth in this specification, it is possible to 
obtain any desired segment of the sequences set forth 
herein using the PCR technique and naturally available 
sources of Renilia genomic material. A specific 
example of such a technique for isolating the Renilia 
luciferase chromosomal gene is described in the 
examples that follow. The cloned gene can then be 
inserted into commercial vectors and expressed. 

Although the techniques set forth above, when 
used in combination with the knowledge of those skilled 
m the art of genetic engineering and the previously 
stated guidelines, will readily enable isolation of the 
desired gene and its use in recombinant DNA vectors now 
that sufficient information is provided to locate the 
gene, other methods which lead to the same result are 
also known and may be used in the preparation of recom- 
binant DNA vectors of this invention. 

Expression of Renilia protein can be enhanced 
by including multiple copies of the gene in a trans- 
formed host; by selecting a vector known to reproduce 
in the host, thereby producing large quantities of pro- ' 
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tein from exogeneous inserted DNA (such as pUC8 • 
Ptacl2; pIN-III-om P Al, 2, or 3; pOTS; pASl; or pKK223- 
3); or by any other known means of enhancing peptide 

expression, 

5 One common variation is the preparation of a 

polypeptide of the invention in the form of a fused 
polypeptide. Such peptides are typically prepared by 
using the promoter region of a gene known to be ex- 
pressed in a host and inserting nucleotides that encode 
all or a major portion of the amino acid sequence of 
the invention into the genetic sequence for the host 
protein. Examples of such fused proteins include fi- 
galactosidase fused proteins, if desired, the fused 
peptide can be designed so that a site recognized by a 
proteolytic enzyme is present at the junction between 
the two fused proteins. The proteolytic enzyme can 
then be used to cleave the expressed protein so that 
the desired luciferase enzyme is available in pure 
form. 
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In all cases, a Renilla luciferase will be ex- 
pressed when the. DNA sequence is functionally inserted 
mto the vector. By "functionally inserted" is meant 
m proper reading frame and orientation, as is well 
understood by those skilled in the art. Typically a 
gene will be inserted downstream froma promoter and 
will be followed by a stop codon, although production 
as a hybrid protein (possibly followed by cleavage) may 
be used, if desired. 

In addition to the above general procedures 
which can be used for preparing recombinant DNA mole- 
cules and transformed unicellular organisms in accor- 
dance with the practices of this invention, other known 
techniques and modifications thereof can be used in 
carrying out the practice of the invention, m partic- 
ular, techniques relating to genetic engineering have 
recently undergone explosive growth and development 
Many recent u.S._patents disclose plasmids, genetically 
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engineering microorganisms, and methods of conducting 
genetic engineering which can be used in the practice 
of the present invention. For example, U.S. Patent 
4,273,875 discloses a plasmid and a process of isolat- 
5 ing the same. U.S. Patent 4,304,863 discloses a pro- 
cess for producing bacteria by genetic engineering in 
which a hybrid plasmid is constructed and used to - 
transform a bacterial host. U.S. Patent 4,419,450 
discloses a plasmid useful as a cloning vehicle in re- 
10 combinant DNA work. U.S. Patent 4,362,867 discloses 
recombinant cDNA construction methods and hybrid nuc- 
leotides produced thereby which are useful in cloning 
processes. U.S. Patent 4,403,036 discloses genetic 
reagents for generating plasmids containing multiple 
15 copies of DNA segments. U.S. Patent 4,363,877 disclo- 
ses recombinant DNA transfer vectors. U.S. Patent 
4,356,270 discloses a recombinant DNA cloning vehicle 
and is a particularly useful disclosure for those with 
limited experience in the area of genetic engineering 
since it defines many of the terms used in genetic 
engineering and the basic processes used therein. US 
Patent 4,336,336 discloses a fused gene and a method of 
making the same. U.S. Patent 4,349,629 discloses plas- 
mid vectors and the production and use thereof, u S 
Patent 4,332,901 discloses a cloning vector useful 'in 
recombinant DNA. Although some of these patents are 
directed to the production of a particular gene product 
that is not within the scope of the present invention, 
the procedures described therein can easily be modified 
to the practice of the invention described in this 
specification by those skilled in the art of genetic 
engineering. 

The implications of the present invention are 
significant in that useful amounts of Renilla 
luciferase and genetic material of the invention will 
become available for use in the development of hybridi- 
zation assays or in any other type of assay utilizing 
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these materials. Transferring the Renilla lucif erase 
cDNA which has been isolated to other expression 
vectors will produce constructs which improve the ex- 
pression of luciferase in E. coli or express the 
5 polypeptide in other hosts. 

Particularly contemplated is the isolation of 
genes from related organisms using oligonucleotide 
probes based on the principal and variant nucleotide 
sequences disclosed herein. Such probes can be 
10 considerably shorter than the entire sequence but 
should be at least 10, preferably at least 14, 
nucleotides in length. Intermediate oligonucleotides 
from 20 to 500, especially 30 to 200, nucleotides in 
length provide particularly specific and rapid-acting 
15 probes. Longer oligonucleotides are also useful, up t 
the full length of the gene. Both RNA and DNA probes " 
can be used. 

In use, the probes are typically labelled in a 
detectable manner (e.g., with 32 p# 3 Hf biotin , Qr av -_ 
20 dm) and are incubated with single-stranded DNA or RNA 
from the organism in which a gene is being sought. 
Hybridization is detected by means of the label after 
single-stranded and double-stranded (hybridized) DNA 
(or DNA/RNA ) have been separated (typically using 
25 nitrocellulose paper,. Hybridization techniques suit- 
able for use with oligonucleotides are well known. 

Although probes are normally used with a 
detectable label that allows easy identification, 
unlabeled oligonucleotides are also useful, both as 
precursors of labeled probes and for use in methods 
that provide for direct detection of double-stranded 
DNA (or DNA/RNA) . Accordingly, the term "oligo- 
nucleotide probe" refers to both labeled and unlabeled 
forms . 
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In summary, the inventors have reduced the 
present invention to practice by isolating and 
sequencing a cDNA clone for Renilla reniformis 
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luciferase. The deduced amino acid sequence from this 
cDNA, beginning at the first methionine residue, 
predicts a protein of M r equal to 36 kd, which is the 
approximate size of native Renilla luciferase. The 
deduced amino acid sequence also contains within it ai: 
six peptide sequences from V-8 protease-digested native 
Renilla luciferase. Only one mis-match was found - 
between these two sets of amino acid data, a 
substitution of a tryptophan for a leucine present in 
the peptide sequence. Comparisions of the native amine 
acid composition and the predicted recombinant 
luciferase composition reveal a very high degree of 
similarity with many identities between specific amino 
acid residues. 

Additionally, expression of luciferase in a 
genetically engineered organism has been 
demonstrated. Luciferase activity was found in crude 
extracts of the original luciferase clone XRLuc-6. 
Subcloning the cDNA into the vector pTZ18R increased 
20 this activity enough to allow the purification of 
recombinant luciferase from the pTZRLuc-1 cells. 
Recombinant luciferase can be purified by a much 
simplified method from that previously used in the 
purification of native luciferase. The recombinant 
25 luciferase functions identically to native luciferase 
in all aspects analysed thus far. Like native, 
recombinant luciferase has an emission spectrum with a 
Xmax at 480 nm and a shoulder at 400 nm. The 
absorption spectrum of recombinant luciferase is also 
30 identical to that of native. Additionally, both native 
and recombinant luciferase are very stable at 37«c for 
several hours as well as having significant stability 
at 45«C. Using the specific activity determined for 
native luciferase, protein determinations made based on 
light emission correlate very well with A 280 and Lowry 
protein determinations, suggesting that the specific 
activity of recombinant luciferase is similar to, if 
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not the same as, that of native luciferase. Finally, 
araino-terrainus amino acid sequence analysis of 
recombinant luciferase shows an identical sequence to 
that of the cDNA-predicted amino acid sequence from 
5 residues 2 through 18. A significant amount of the 
recombinant protein is blocked at the amino terminus, 
probably by N-formyl methionine, which accounts for the 
inability to determine the amino acid at residue 1. 

The invention now being generally described, 
the same will be better understood by reference to the 
following examples which are provided for purposes of 
illustration only and are not to be considered limiting 
of the invention unless so specified. 
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EXAMPLES 



Assay for Luciferase Activity 

Crude supernatants or pure recombinant 
luciferase samples {10 to 100 „l} were added to 1 ml of 
luciferase assay buffer {0.5M NaCl, 0.1M KP0 4 pH 7.6, 
lmM EDTA, 0.02% BSA, and 0.004% NaNg} and vortexedin 
12 X 75 mm test tube. Synthetic benzyl luciferin {lOul 
of a 2.5 nmol/pl stock} was added to the reaction to 
give a final concentration of 2.5 X 10~ 8 M, and the 
mixture was vortexed rigorously for 4-5 sec. The tube 
was placed immediately in a Turner Model TD-20e 
luminometer and peak light emission was determined and 
converted to photons using a 63 Ni radioactive light 
emission calibration standard. 

RNA Isolation and cDNA Synthesis 

Live Renilla reniformis were collected by 
bottom trawling in shallow waters off Sapelo Island in 
the state of Georgia at the University of Georgia 
Marine Institute. The animals were washed thoroughly 
in fresh seawater, quick frozen in liquid nitrogen, and 
stored at -80-C. Frozen Renilla were crushed to a fine 
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powder under l iquid nit rogen with a morter and 
pestle. The powdered tissue was then homogenized with 
a Warxng blender in 4 M guanidine thiocyanate, and 
total RNA was isolated as described in Chirgwin et al 
5 Biochemistry (1970) 18:5294-5299. Total RNA wasThe^' 
passed over an oligo-dT cellui ose column to obtain 
polyadenylated RNA which was stored as an ethanol - 
precipitate at -20-C. Single and double stranded cDNA 
were synthesized from poly A + RNA by modification of 
the Gubler and Hoffman method, Gubler et al., Gene 
(1983, 25:263-269, as described below. Following 
polymerase blunting and methylation of the cDNAs 
synthetic EcoRI linkers were blunt-end ligated. After 
digestion with EcoRI , the excess linkers were separated 
from the cDNAs by low-melt agarose gel electrophor- 
esis, only cDNA's greater than about 650 bp in length 
were isolated from the low melt gel. 

Construction and Screening of tne X(? tl1 Mh 

Purified cDNA's were ligated into EcoRl- 
digested Agtll. The DNA was then packaged using x 
Phage extracts (Gigapack Plus Kit, Strategene, 
Several fractions of the packaged library were titered 
xn Y1088 cells; these fractions ranged from 71% to 81% 
recombinant phage as determined by the lack of iptg- 
xnducible 8-galactosidase activity. The total number 
of recombinant phage was equal to 2.1 x 10* pfu (plaque 
forming units,. The primary library was then amplified 
xn Y1088 cells and stored in 7% DMSO at -80"C The 
30 titer of the amplified library was 2.5 X 10? pfu /ml and 
was approximately 65% recombinant. 

Two 17-base oligonucleotide probes were- 
synthesized based on amino acid sequence data from 
isolated peptides derived from V-8 protease digested 
35 native Renilla luciferase. Shown in Figure 4 are the 
ammo acid sequences of the seven V-8 luciferase 
peptides. The amino acid sequences with the lowest 



25 



OCIO: <WO S21S673A1> 



WO 92/15673 PCT/US91/016I4 

21 

codon redundancy were selected for synthesis of 
luciferase oligonucleotide Probe #1 and Probe #2, which 
are shown highlighted with their derived nucleotide 
sequences (lower portion of Figure 4). Pro be #1 was 
5 derived from peptide 7 and contained 32 redundancies 
while Probe # 2/ derived from peptide i, contained 64 ' 
redundancies. The probes were end-labeled with T>-4 
polynucleotide kinase to high specific activity {4-9 x 
10 cpm/ug}. yi088 cells were infected with enough 
10 phage to give 3 X io« pfu/plate. The infected cells 

were plated in 6 ml of top agarose onto 150 mm diameter 
Luna plates containing 50 „g/ml ampicillin. After 
overnight incubation at 37°C, the plates were chilled 
at 4»c before performing plaque lifts. To eliminate 
15 false positive signals, duplicate nitrocellulose filter 
plaque replicas were prepared from each master plate 
Filters were processed by base treatment followed by 
neutralization in Tris buffer. 

The filters were air dried and baked at 80°C 
20 in vacuo. Prehybridization was for at least 6 hours 
37-C in 6X SSC, 50 mM Sodium Phosphate (pH 6.8), 5X 
Denhardfs, and 100 ug/ml denatured Herring sperm 
DNA. Hybridization was overnight at 37»C in 
prehybridization solution with the addition of dextran 
25 sulfate to a final concentration of 10%. The labeled 
probes were added to the hybridization solution at 1-2 
X 10 b cpm/ml. 

Filter washes were done in the presence of 
tetramethylammonium choride under the conditions 

30 described for a 17-base oligonucleotide in Wood et al. , 
Proc. Nat. Acad. Sci. USA (1985) 82:1585-1588. Each 
duplicate filter was hybridized to both probes - in the 
first round of screening; in subsequent rounds, the 
duplicate filters were hybridized to either Probe #1 or 

35 Probe #2. All cDNA clones were plaque purified after 
three or four rounds of screening; phage DNA was 
isolated from each clone on glycerol step gradients as 
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described in Grossberger, D. , Nuc. Acid. Res. (1987) 
15(16) :6737. 

DNA Sequence Analysis 

All DNA sequence analysis was done in the M13 
vectors mpl8 and mpl9. Single stranded templates were 
prepared and dideoxynucleotide sequencing was preformed 
using a Sequenase DNA Sequencing Kit obtained from 
United States Biochemical Corporation. Sequencing 
reactions were primed using either the M13 universal 
primer, a primer which hybridized to extraneous xgtll 
DNA present in some constructs, or the oligonucleotide 
probes. Sequence data obtained from both ends of the 
cDNA was analyzed for six base restriction enzyme sites 
which were used to generate sequencing subclones 
(Figure 5). In this way, the entire 1.2 kb cDNA was 
sequenced on both strands (lower portion of Figure 
5). All DNA sequences and translated protein sequences 
were assembled and analyzed using MicroGenie Sequence 
Software purchased from Beckman. 



Expression in E. coli 

The initial lucif erase cDNA clone, XRLuc-6, 
was in the expression vector xgtll. The clone was 
25 ampliflied in Y1088 cells and the high titer stock was 
used to make lysogens in ¥1089. The xRLuc-6 lysogen 
was then grown in Luria broth plus ampicillin (50 
ug/ml) at 37°C. The cells were pelleted, resuspended 
in TE buffer, and lysed with lysozyme (2 mg/ml). The 
cell debris was then pelleted and the supernatant was 
assayed for lucif erase activity. The 2.2kbp XRLuc-6 
insert which included .1 kb of xgtll lacZ DNA attached 
to the 3' end was isolated on a low-melt gel and 
subcloned into the EcoRI/SstI sites of ,pTZ18R 
35 (Pharmacia). This construct, pTZRLuc-1, was used in 

the expression and purification of recombinant Renilla 
lucif erase. 
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Electrophore tic and Western Analy sis 

Recombinant luciferase samples were 
characterized on Commassie-stained SDS-PAGE gels. For 
western analysis, the gels were run and transferred to 
nitrocellulose filters at 30 mA in transfer buffer as 
described in Burnett, N.W., Analytical Biochemistry 
(1981) 112:195-203. The filters were blocked with 3% 
BSA and incubated with a 1/1000 dilution of polyclonal 
rabbit-anti-lucif erase antibodies. Next, the filter 
was washed in TBS and incubated with al/2500 dilution 
of the secondary antibody, goat-anti-rabbit IgG 
conjugated to horseradish peroxidase (Bio-Rad). 
Finally, the filter was washed in TBS and developed 
15 with HRP-Color Developing reagent (Bio-Rad). 

Emission Spectra 

Crude samples of pTZLuc-l cells were prepared 
as described previously in this text. The sample was 
added to 1 ml of luciferase assay buffer; 1 „1 
luciferin (917 nmol/ul) was added at 1-2 minute 
intervals to maintain the signal. The bioluminescence 
emission spectra were obtained and corrected using an 
on-line computerized SPEX fluorimeter. Multiple 
spectra were scan averaged to give the final spectrum 
which was measured from 675 nm to 375 nm. 
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Protein Purification 

Purification of recombinant Renilla luciferase 
from pTZRLuc-1 E. coli extracts was accomplished in 
three chromatographic steps. Recombinant luciferase 
was purified from pTZRLuc-1 cells as follows: pTZRLuc- 
1 cells were grown in 20L Luria broth at 37°c at an 
OD 600 =0 - 6 at wni ch time IPTG was added to a final 
35 concentration of 0.5 mM; cells continued to grow 
overnight at 30°C. The cells were harvested by 
centrifugation, washed in TE, resuspended in 5 ml of 10 
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mM EDTA {pH 8) per gram of cells, and frozen at 
-20«C. In a typical purification, 15 to 30 grams of 
cells were thawed. Lysozyme was added to a final 
concentration of 4 to 6 mg/ml, and the cells were held 
5 on ice for 45 minutes. DNase 1 (10 to 20 mg, was added 
to the lysate which was sonicated on ice with 1 minute 
bursts from a Branson Cell Disrupter until 90% of the 
cells were lysed as evidenced by microscopic 
examination. 

10 The crude material was clarified by 

centrifugation at 48 X g for 30 minutes and loaded onto 

the first column. The extract was first run on a DEAE- 

Cellulose ion-exchange column followed by a G-100 

Sephadex gel filtration column and then a Benzoic Acid- 

15 Sepharose affinity column. The G-100 column was run in 

IX Renilla Standard Buffer (1.5 mM Tris, l.o mM EDTA p H 

7.8). The other columns were run in IX buffer and were 

eluted in 10X buffer (DEAE) or Sodium Benzoate in 10X 

buffer (Benzoic Acid-Sepharose) . The first Benzoic 

Acid column was eluted with 0.1 M sodium benzoate 

pulse. The second Benzoic Acid column was eluted with 

a 0 to 0.5 M sodium benzoate gradient. Protein 

determinations were made by A 28Q measurements using the 

extinction coefficient of native luciferase { e , on 0 11 
= ■) i 1 t_ i » ■ . "SOnm"-- 1 -* 
^--Li, by light emission using the specific activity 

of naive luciferase {l. 8 X 10 15 hv sec" 1 mg"!}, or by 
Bradford assays as described in Bradford, M. , 
Analytical Biochemistry (1976) 72: 248. Absorption 
spectra were measured and collected on a Varian Model 
30 DMS-100 spectrophotometer. 

Isolatio n and Analysis of xrluc-6 

The primary screen of 1 X 10 6 recombinant 
phage resulted in the isolation of nine clones which 
35 gave identical autoradiographic signals on both replica 
filters. Of the nine original positives, only five 
gave signals on the second screening, and only one of 
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the five hybridized to both probes. The other four 
hybridized only to Probe #2, which has the greatest 
sequence redundancy. Restriction enzyme analysis of 
the five clones revealed that xRLuc-3 and xRLuc-8 were 
5 identical and contained a 1.16 kb insert. xRLuc 2, 5, 
and 6 had insert sizes of 0.8, 2.34 and 1.2 kbp 
respectively. Only the XRLuc-3 and XRLuc-8 inserts 
could be exercised from the EcoRI cloning site by EcoRI 
digestion. The other three inserts had apparently lost 
one EcoRI linker site; these had to be cut with EcoRI 
and Sstl. Thus, each of these cDNAs contained 1 kb of 
xgtll DNA attached at one end. Since only XRLuc-6 
hybridized to both oligonucleotide probes and contained 
a cDNA of the size necessary to code for an 
approximately 36 kd protein, it was chosen for DNA 
sequence analysis. 

The 2.2kb EcoRI/SstI fragment, which contained 
1 kb of xgtll lac Z DNA, was subcloned into M13 and 
mpl8 and m P 19 and both strands of the 1.2 kb cDNA were 
20 completely sequenced. The entire cDNA sequence is n 96 
bp, excluding the EcoRI linker (Figure 1). 
Structurally, it contains a putative initiation codon 
beginning at nucleotide 10, a stop codon at nucleotide 
944, a polyadenylation consensus sequence at nucleotide 
1170, and a short polyadenylated tail of seven 
nucleotides (Figure 1). Also shown underlined in 
Figure 1 are the two oligonucleotide hybrization sites 
located at nucleotides 537-554 (Probe #1) and 
nucleotides 820-836 (Probe #2). The loss of the EcoRI 
site at the 3' end of the cDNA was confirmed by the 
sequence analysis. 

The cDNA does not contain a stop codon in 
frame with and upstream from the first initiation codon 
as an indication that the protein coding region is full 
length. However, the coding region directs the 
recombinant systhesis of fully active Renilla 
lucif erase, as discussed below. Translating the cDNA 
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sequence into an amino acid sequence gave conclusive 
evidence that the XRLuc-6 cDNA was a Renilla luciferase 
cDNA. The translated cDNA sequence contains an open 
reading frame of 314 amino acids (Figure 2). The first 
methionine is preceded by three amino acids which may 
or may not be part of the native protein sequence, if 
the in vivo translation begins at the first methionine, 
an open reading frame of 311 amino acids results which 
codes for a protein of molecular weight (M r ) "36 Kd. 
The M r of native Renilla luciferase has been measured 
by various methods with values ranging from 33 Kd to 38 
Kd. Comparing the amino acid composition of this 
translated amino acid sequence with that of the 
previously published native luciferase composition 
shows a very close homology with many identities 
between the two (Table 2). 
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Do^Ti! 10 AC L id Co . m P°sition of Native and 
Recombinant Renilla reniformis Luciferase 



Amino Acids 

Lysine 

Histidine 

Arginine 

Aspartate* 

Threonine 

Serine 

Glutamate** 

Proline 

Glycine 

Alanine 

Valine 

Methionine 

Isoleucine 

Leucine 

Tyrosine 

Phenylalanine 

Tryptophan 

Cysteine 



Residues 
(Native) 

26 
10 
12 
31 
9 
20 
36 
17 
19 
19 
23 
7 
20 
23 
12 
15 
7 
3 



Residues 
i Recombinant- ) 



27 
10 
13 
30 
6 
19 
37 
18 
17 
19 
23 
9 
21 
22 
13 
15 
8 
3 
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eflr 1 B?i f K r *? e , con, P°sition data taken from Matthews 
££ al- , Biochemistry (1977) 16: 85-91. "a"news 
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Further evidence that the cDNA does code for 
luciferase can be seen by comparing the V-8 protease 
peptide sequences with the translated cDNA sequence 
(Figure 3). All v-8 peptides were located on the 
carboxyl-terminal half of the translated coding region 
beginning at residue 161; several overlapped with one 
another. Except at one residue, 219, where the cDNA 
sequence predicts a tryptophan but peptide 6 sequence 
indicates a leucine at the same position, all peptides 
matched exactly to regions of the translated 
sequence. Bunching of the peptides at one end of the 
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protein sequence may be due to the amino-terminal half 
of the native protein being folded in such a way as to 
be inaccessible to V-8 protease. 

5 Expression of Recomb inant Luciferase in E. rnii 

The original XRLuc 6 lysogen showed low levels 
of luciferase activity as determined by light 
emission. iptg induction of xRLuc-6 lysogens led to an 
approximate 50% decrease in activity. This result was 
10 later explained when DNA sequence data revealed that 
the 3 1 end of the cDNA was adjacent to the lac 2 
sequence in xgtll. Therefore, under conditions of i PTG 
induction, transcription was being forced in the wrong 
direction with respect to the luciferase cDNA 
15 orientation. Presumably, the non-induced luciferase 
expression in this construct was due to promoter 
activity from the left end of xgtll at a site which we 
have not determined. 

The construct pTZRLuc-1 was made to simplify 
20 the isolation of DNA fragments for use as probes in 
Southern and Northen analysis (Figure 6). E. C oli 
cells harboring this plasmid are referred to as 
PTZRLuc-1 cells. Similar to xgtll, the pTZ series 
"phagemids" contain a polylinker site adjacent to the 
25 lac Z' gene. Expressed genes in this vector could 

potentially be expressed containing the first 10 to 15 
amino acids of e-galactosidase fused to the cDNA 
translation product. Analysis of pTZRLuc-1 cell 
supernatants for light emission showed that, relative 
30 to XRLuc-6, high levels of luciferase activity were 
present. Furthermore, induction of pTZRLuc-1 cells 
with 0.5 inn iptg led to an increase in luciferase 
activity of - 5-8 fold in crude extracts. 

The bioluminescence emission spectrum from 
5 these crude supernatants was identical to the 

previously published bioluminescence emission spectrum 
for native Renilla luciferase. .The wavelength 
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distribution of light emission i. essentially identical 
to that reported earlier. The spectrum had an emission 
maximum (xmax) at 480 nm with a slight shoulder at 400 
nm, which presumably corresponded to the luciferase- 
oxyluciferin complex neutral species excited state 

The pTZRLuc-1 crude supernatants „ ere further 
characterized by SDS-PAGE . The Coomassie-stained- gel 
contained numerous bands, one of which ran in the 
vicinity of native luciferase. To confirm that this 
band was recombinant luciferse, Western analysis was 
performed using rabbit polyclonal antibodies raised 
against native Renilla luciferase. The developed 
Western showed one band that migrated at the same 
position as native luciferase. No other products 
15 indicative of B-galactosidase-lucif erase fusion 

polypeptide were apparent, suggesting that either any 
putative fusion protein is in too low a concentration 
to be detected or, .ore likely, that no fusion protein 
is made. Though it has not been confirmed by DNA 
sequence analysis, any pTZRLuc-1 translation products 
initiating at the e-galactosidase ATG start codon 
within the first three codons immediately adjacent to 
the first cDNA start codon may explain why we see iptg 
induction of luciferase activity without production of 
a fusion product. 

IPTG induction of recombinant luciferase 
indicates that its transcription is directed by the lac 
Z promoter, since the only candidate ribosome binding 
site (RBS) is probably positioned too far (18 nucleo- 
tides) from the luciferase ATG to be functional, we 
suspect that a e-galactosidase peptide is being 
translated to the stop codon immediately adjacent to 
the luciferase ATG. The translation of a e- 
galactosidase peptide may facilitate ribosome 
reintitiation at the luciferase ATG codon (Figure 7) 
This event could occur if the dinucleotide AG was 
acting as a RBS for the luciferase cDNA. m this way 
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an IPTG inducible, non-fusion lucif erase polypeptide 
could be synthesized. Given the success of recombinant 
lucif erase expression using the pTZ18 vector, which was 
designed as a multi-purpose in vitro transcription 
5 vector rather than an expression vector, it is obvious 
that other clones can be developed which express 
luciferase at levels greater than those which we - 
currently obtain. 

10 Purification of Recomb inant Renilla LuciferasP 
Using the specific activity for native 
luciferase, we made calculations for the amount of 
luciferase present in IPTG induced, pTZRLuc-1 crude 
supernatants and determined that the amount of 
recombinant luciferase being produced was sufficient to 
attempt initial purification on a small scale. 

In IPTG induced pTZRLuc-1 cells, recombinant 
luciferase represents approximately 12-14% of the total 
protein in the clarified crude supernatant. Although 
significant losses of recombinant luciferase were 
suffered in this initial purification, the amount of 
starting material and time involved made the loss seem 
insignificant when compared to the purification of 
native luciferase. The purification scheme for the 
recombinant Renilla luciferase is shown in Figure 8; 
the purification is summarized in Table 3. SDS-PAGE 
analysis of the purification steps shows increasing 
amounts of recombinant luciferase with respect to 
contaminating protein. The Benzoic Acid-Sepharose 
luciferase is approximately 99% pure as evidenced by a 
single band of M r equal to 34 Kd. Very slight 
contamination was noticible on the Coomassie stained 
gel if more than 20 v g of protein were loaded. 
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important feature for the utility of recombinant 
lucif erase in diagnostic applications, many of which 
require incubation at physiological temperatures. 

Purification of recombinant Renilla luciferase 
5 has allowed us to determine its amino-terminal 

sequence. The amino acid sequence of the first 18 
residues was determined by Edman degradation. The 
amino acid peak heights of the sequence data indicated 
that far less protein was actually being sequenced than 
10 was initially protein synthesis, it may be that a large 
percentage of recombinant luciferase is N-formylated at 
the initiating methionine and thus blocked to the Edman 
reaction, in spite of this apparent amino-terminal 
block, enough unblocked species were available that we 
15 were able to obtain sequence for the first 18 residues 
of recombinant luciferase. The amino acid sequence is 
identical to the. translated cDNA sequence from residues 
2 through 18 (Figure 7). On cycle 1 of the amino acid 
sequencing run, we were not able to confirm the 
presence of a methionine at the first residue as 
predicted by the cDNA sequence. However, the fact that 
the two sets of amino acid data are identical from 
amino acid residue 2 (Threonine) to residue 18 
(Proline) strongly supports our assertion that the 
first predicted methionine in our sequence is acting as 
the initiation codon in the pTZRLuc-1 construct. 
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All publications and patent applications 
mentioned in this specification are herein incorporated 
by reference to the same extent as if each individual 
publication or patent application was specifically and 
individually indicated to be incorporated by reference. 

The invention now being fully described, it 
will be apparent to one of ordinary skill in the art 
35 that many changes and modifications can be made thereto 
without departing from the spirit or scope of the 
appended claims. 
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important feature for the utility of recombinant 
lucif erase in diagnostic applications, many of which 
require incubation at physiological temperatures. 

Purification of recombinant Renilla luciferase 
5 has allowed us to determine its amino-terminal 

sequence. The amino acid sequence of the first 18 
residues was determined by Edman degradation. The 
amino acid peak heights of the sequence data indicated 
that far less protein was actually being sequenced than 
was initially protein synthesis, it may be that a large 
percentage of recombinant luciferase is N-f ormylated at 
the initiating methionine and thus blocked to the Edman 
reaction, in spite of this apparent amino-terminal 
block, enough unblocked species were available that we 
were able to obtain sequence for the first 18 residues 
of recombinant luciferase. The amino acid sequence is 
identical to the translated cDNA sequence from residues 
2 through 18 (Figure 7). On cycle 1 of the amino acid 
sequencing run, we were not able to confirm the 
presence of a methionine at the first residue as 
predicted by the cDNA sequence. However, the fact that 
the two sets of amino acid data are identical from 
amino acid residue 2 (Threonine) to residue 18 
(Proline) strongly supports our assertion that the 
first predicted methionine in our sequence is acting as 
the initiation codon in the pTZRLuc-1 construct. 
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All publications and patent applications 
mentioned in this specification are herein incorporated 
by reference to the same extent as if each individual 
publication or patent application was specifically and 
individually indicated to be incorporated by reference. 

The invention now being fully described, it 
will be apparent to one of ordinary skill in the art 
that many changes and modifications can be made thereto 
without departing from the spirit or scope of the 
appended claims. 
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WHAT IS CLAIMED IS : 

1. An isolated DNA or RNA molecule, which 
comprises a nucleotide sequence coding for Renilla 



5 lucife 



rase. 



2. The molecule of Claim 1, wherein said molecul* 
comprises the luciferase coding sequence: 

10 AGCTTAAAGATGACTTCGAAAGTTTATGATCCAGAACAAAGGAAACG6AT 
GATAACTGGTCCGCAGTGGTGGGCCAGATGTAAACAAATGAATGTTCTTG 
ATTCATTTATTAATTATTATGATTCAGAAAAACATGCAGAAAATGCTGTT 
ATTTTTTTACATGGTAACGCGGCCTCTTCTTATTTATGGCGACATGTTGT 
GCCACATATTGAGCCAGTAGCGCGGTGTATTATACCAGATCTTATTGGTA 
15 TGGGCAAATCAGGCAAATCTGGTAATGGTTCTTATAGGTTACTTGATCAT 
TACAAATATCTTACTGCATGGTTTGAACTTCTTAATTTACCAAAGAAGAT 
CATTTTTGTCGGCCATGATTGGGGTGCTTGTTTGGCATTTCATTATAGCT 
ATGAGCATCAAGATAAGATCAAAGCAATAGTTCACGCTGAAAGTGTAGTA 
GATGTGATTGAATCATGGGATGAATGGCCTGATATTGAAGAAGATATTGC 
20 GTTGATCAAATCTGAAGAAGGAGAAAAAATGGTTTTGGAGAATAACTTCT 
TCGTGGAAACCATGTTGCCATCAAAAATCATGAGAAAGTTAGAACCAGAA 
GAATTTGCAGCATATCTTGAACCATTCAAAGAGAAAGGTGAAGTTCGTCG 
TCCAACATTATCATGGCCTCGTGAAATCCCGTTAGTAAAAGGTGGTAAAC 
CTGACGTTGTACAAATTGTTAGGAATTATAATGCTTATCTACGTGCAAGT 
25 GATGATTTACCAAAAATGTTTATTGAATCGGATCCAGGATTCTTTTCCAA 
TGCTATTGTTGAAGGCGCCAAGAAGTTTCCTAATACTGAATTTGTCAAAG 
TAAAAGGTCTTCATTTTTCGCAAGAAGATGCACCTGATGAAATGGGAAAA 
TATATCAAATCGTTCGTTGAGCGAGTTCTCAAAAATGAACAA 

30 or an equivalent DNA or RNA sequence. 

3. The molecule of Claim 2, wherein said .molecule 
is DNA. 

35 4. The molecule of Claim 3, wherein said molecule 

contains said luciferase sequence. 
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5. The molecule of Claim 2, wherein said molecule 
is RNA and contains a sequence equivalent to said 
lucif erase sequence. 

5 

6. The molecule of Claim 1, wherein said sequence 
is preceded by a functional promoter sequence 5' to 
said sequence. 

10 7. The molecule of Claim 6, wherein at least one 

copy of said sequence is present in a recombinant DNA 
or RNA vector. •• 

8. A genetically engineered microorganism, 
15 wherein said microorganism comprises the vector of 

Claim 7. 

9. The microorganism of Claim 8, wherein said " " ~, 
microorganism is an E. coli strain. 
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10. An isolated oligonucleotide, comprising at 
least 10 consecutive nucleotides selected from nucle- 
otide sequence: 

AGCTTAAAGATGACTTCGAAAGTTTATGATCCAGAACAAAGGAAACGGAT 

GATAACTGGTCCGCAGTGGTGGGCCAGATGTAAACAAATGAATGTTCTTG 

ATTCATTTATTAATTATTATGATTCAGAAAAACATGCAGAAAATGCTGTT 

GCCACATATTGAGCCAGTAGCGCGGTGTATTATACCAGATCTTATTGGTA 

TGGGCAAATCAGGCAAATCTGGTAATGGTTCTTATAGGTTACTTGATCAT 

TACAAATATCTTACTGCATGGTTTGAACTTCTTAATTTACCAAAGAAGAT 

CATTTTTGTCGGCCATGATTGGGGTGCTTGTTTGGCATTTCATTATAGCT 

ATGAGCATCAAGATAAGATCAAAGCAATAGTTCACGCTGAAAGTGTAGTA 

GATGTGATTGAATCATGGGATGAATGGCCTGATATTGAAGAAGATATTGC 

GTTGATCAAATCTGAAGAAGGAGAAAAAATGGTTTTGGAGAATAACTTCT 

TCGTGGAAACCATGTTGCCATCAAAAATCATGAGAAAGTTAGAACCAGAA 

GAATTTGCAGCATATCTTGAACCATTCAAAGAGAAAGGTGAAGTTCGTCG 

TCCAACATTATCATGGCCTCGTGAAATCCCGTTAGTAAAAGGTGGTAAAC 
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CTGACGTTGTACAAATTGTTAGGAATTATAATGCTTATCTACGTGCAAGT 

GATGATTTACCAAAAATGTTTATTGAATCGGATCCAGGATTCTTTTCCAA 

TGCTATTGTTGAAGGCGCCAAGAAGTTTCCTAATACTGAATTTGTCAAAG 

TAAAAGGTCTTCATTTTTCGCAAGAAGATGCACCTGATGAAATGGGAAAA 

TATATCAAATCGTTCGTTGAGCGAGTTCTCAAAAATGAACAATAATTACT 

TTGGTTTTTTATTTACATTTTTCCCGGGTTTAATAATATAAATGTCATTT 

TCAACAATTTTATTTTAACTGAATATTTCACAGGGAACATTCATATATGT 

TGATTAATTTAGCTCGAACTTTACTCTGTCATATCATTTTGGAATATTAC 

CTCTTTCAATGAAACTTTATAAACAGTGGTTCAATTAATTAATATATATT 

ATAATTACATTTGTTATGTAATAAACTCGGTTTTATTATAA, 

complementary DNA sequences, and equivalent or 
complementary RNA sequences. 

11. The oligonucleotide of Claim 10, wherein said 
oligonucleotide is labeled with a detectable tag. 



12. The oligonucleotide of Claim 10, wherein said 
oligonucleotide comprises at least 14 consecutive 

20 nucleotides. 

13. A method of identifying genetic material 
encoding a coelenterate luciferase gene, comprising; 

isolating genetic material from a coelenterate 
25 organism to form a sample of genetic material, 

contacting said sample with an oligonucleotide 
of Claim 10 under hybridizing conditions, and 

detecting formation of a duplex comprising 
said oligonucleotide and DNA or RNA present in said 
30 sample. 



14. The method of Claim 13, where said method 
comprises a polymerase chain reaction. 

15. A genetically engineered peptide encoded by a 
nucleotide sequence of Claim 1. 
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16. 



The peptide of Claim 15, wherein said peptide 
is non-glycosylated. 

17. A peptide comprising the amino acid sequence cf 
Figure 3 or a fragment of said sequence comprising at least 
5 consecutive amino acid residues in which the fragment i s 
immunologically reactive with an antibody that specifically 
binds Renillalucif erase, wherein said peptide is free from 
other Renilla peptides. 



10 



15 



20 



25 



30 



35 



OClD:<WO 92<S673A1> 



WO 92/15673 



PCT/US91/01614 



1 / 8 



FIG- I 10 20 30 40 50 fin 

AG CTTAAA6 A TSACTTCG AA AGTTTATGAT CCAG AACAAA GGAAACGGAT GATAACTGGT 

70 80 90 100 110 12Q 

CCGCAGTGGT GGGCCAGATG TAAACAAATG AATGTTCTTG ATTCATTTAT TAATTATTAT 

130 140 150 160 170 inn 

GATTCA6AAA AACATGCAGA AAATGCTGTT ATTTTTTTAC ATGGTAACGC GGCCTCTTCT 

190 200 210 220 230 240 

TATTTAT66C GACAT6TTGT 6CCACATATT GAGCCAGTAG CGCGGTGTAT TATACCAGAT 

250 260 270 280 290 300 

CTTATTGGTA TGGGCAAATC AG6CAAATCT GGTAATGGTT CTTATAGGTT ACTTGATCAT 

310 320 330 340 350 360 

TACAAATATC TTACT6CAT6 GTTTGAACTT CTTAATTTAC CAAAGAA6AT CATTTTTGTC 

370 380 390 400 410 420 

GGCCATGATT GGG6TGCTTG TTTGGCATTT CATTATACCT ATGAGCATCA AGATAAGATC 

430 440 450 460 470 480 

AAAGCAATA6 TTCACGCT6A AAGTGTAGTA GATGTGATTG AATCAT6GGA TGAATGGCCT 

490 500 510 520 530 540 

6ATATTGAAG AAG ATATTGC GTTGATCAAA TCTGAAGAAG GA6AAAAAAT GGTTTT6GAG 

550 560 570 580 590 600 

A.ATAACTTCT TCPT6GAAAC catgttgcca tcaaaaatca tgagaaagtt agaaccagaa 

610 620 630 640 650 660 

GAATTTGCAG CATATCTTGA ACCATTCAAA GAGAAAGGTG AAGTTCGTCG TCCAACATTA 

670 680 690 700 710 720 

TCAT6GCCTC GTGAAATCCC GTTAGTAAAA G6T6GTAAAC CTGACGTTGT ACAAATTGTT 

730 740 750 760 770 780 

AGGAATTATA ATGCTTATCT ACGTGCAA6T GATGATTTAC CAAAAATGTT TATTGAATCG 

790 800 810 820 830 840 

GAT CCAG 6 AT TCTTTTCCAA TGCTATTGTT GAAG6CGCC A AG AA6TTTCC TAATArT BAA 

850 860 870 880 890 900 

TTT6TCAAAG TAAAAGGTCT TCATTTTTCG CAAGAAG ATG CACCTGAT6A AATGG6AAAA 

910 920 930 940 950 960 

TATATCAAAT CGTTCGTTGA GCGAGTTCTC AAAAATGAAC AATAATTACT TTGGTTTTTT 

970 980 990 1000 1010 1020 

ATTTACATTT TTCCCGGGTT TAATAATATA AATGTCATTT TCAACAATTT TATTTTAACT 

1030 1040 1050 1060 1070 1080 

GAATATTTCA CAG6GAACAT TCATATATGT TGATTAATTT AGCTCGAACT TTACTCTGTC 

1090 1100 1110 1120 1130 1140 

ATATCATTTT GGAATATTAC CTCTTTCAAT 6AAACTTTAT AAACAGTGGT TCAATTAATT 

1150 1160 1170 1180 1190 

AATATATATT ATAATTACAT TTGTTATGTA ATAAACTCGG TTTTATTATA AAAAAA 
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FIG. 2 

10 

Ser Leu Lys Net Thr Ser Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg Met He Thr Gly 

30 

Pro Gin Trp Trp Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser Phe He Asn Tyr Tyr 

50 

Asp Ser Glu Lys His Ala Glu Asn Ala Val He Phe Leu His Gly Asn Ala Ala Ser sir 

Tyr Leu Trp Arg His Val Val Pro His IU Glu Pro Val Ala Arg Cys lie He Pro As? 

90 

Leu He Gly net Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg Leu Leu Asp Jls° 
Tyr Lys Tyr Leu Thr Ala Trp Phe Glu Leu" Leu Asn Leu Pro Lys Ly, IT lie Phe Val 
Sly Hi, Asp Trp Gly Ala Cys Leu Ala III His Tyr Ser Tyr Glu His Gin Asp Lys iff 
Lys Ala He Val His Ala Glu Ser Val Val Asp Val He Glu Ser Trp Asp Glu Trp Pr? 
Asp he Glu Glu Asp II. Ala Leu He lJ! Ser Glu Glu Gly Glu Lys Met Val Leu 18 ° 



Glu 



190 

Asn Asn Phe Phe Val Glu Thr Het Leu Pro Ser Lys IU Met Arg Lys Leu Glu Pro l?u° 

210 



Glu Phe Ala Ala Tyr Leu Glu Pro Ph. Lys Glu Lys Gly Glu Val Arg Arg Pro Thr 



230 



220 
Leu 

240 



Ser Trp Pro Arg Glu He Pro Leu Val Lys Gly Gly Ly, Pro Asp Val Val 61n He Si 



250 

Arg Asn Tyr Asn Ala Tyr Leu Arg Ala Ser Asp Asp Leu Pro Lys Net Phe He Glu I" 

270 



Asp Pro Gly Phe Phe Ser Asn Ala II. v"al Glu Gly Ala Ly, Ly, Phe Pro A,n Thr GlJ 



290 



Phe Val Lys Val Lys Gly Leu His Phe llr Gin Glu Asp A-l. Pro Asp Glu Het Gly Ly? 



Tyr lie Lys Ser Phe Val Glu Arg Val Leu Lys Asn Glu Gin 
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1) 6LU-6LV-RLR 



LYS-LYS-PHE- PRO-flSN-THH-lGLU 



2) SLU-AR6-UHL-LEIM.YS-ASN-6LU 

3) 6LU-6LY-6LU-LYS-MET-URL-LEU-6LU 

4) GLU-flSP-l LE-RLR-LEU- I LE-LY$-$ER-GLU 

5) GLU-$ER-fl$P-PR0-6LY-PHE-PHE-$ER-fl$N-ILE-URL-6LU 

6) GLU-LY$-6LY-6LU-UflL-flRe-flR6-PR0-THR-LEU-SER-LEU-PR0-flRG- 
GLU-ILE-PR0-LEU-URL-LYS-6LY 



7) 



GLU-RSN-RSN-PHE-PHE-URL- 6LU 



|glu-bsn-bsn-phe-phe-url j 



Luciferase Probe #1: 6 R R - R RT- R RT-TTT-nT- GT 
(32 degeneracies) 6 C C C C 



LVS-LYS-PHE- PRO- RSN-THR 

Luciferase Probe #2: Rflfl-Rflfl-TTT-CCT-HRT-flC 
(64 degeneracies) 6 6 C C C 

fl 
6 



FIG. 4 
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pTZRLuc- 1 



Direction of translation ^ 

lac r RBS lac T start codon 

6TGT66AATT6T6A6Ce6ATAACAATTTCACACASfiAAACA6CTAIfiACCAT6 
putative RBS luclferase cDNA "start" codon 

} I 

ATTACeAATTTAATAC6ACTCACTATA6G6AATTCA£CTIAAA6AiaACTTC6 

I 

lac T In frame stop codon 

AAAGTTTAT6ATCCA6AA 



RBS - ribosome binding site 

FIG. 7 
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pTZRLuc-1 cells in 

V 

Centrlfugatlon 48 x 



10mM EDTA pH8 

Freeze thaw 
once, lysozyme, 
DNase I. 
sonication 

g, 30 min 



discard pellet 



DEAE Sephadex 

Run fn IX.elute in 
10X Standard 
Buffer, pool 
activity, 
concentrate by 
T Amicon Filtration 
G-100 Sephadex - 

Run in 1X 
Standard Buffer, 
pool activity, 
concentrate by 
T Amicon filtration 

Benzoic Acid Sepharose 1 



Elute with 0.1 M 
Sodium Benzoate in 
1 0X Standard Buffer, 
dialyze against IX 
Standard Buffer 



Benzoic Acid Sepharose 2 

Elute with 0 to 0.5M 
Sodium Benzoate 
gradient in 10X 
Standard Buffer, 
dialyze against IX 
Standard Buffer 
Amicon Filtration 



Pure Recombinant Luciferase 
FIG.JB ;; 
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